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I* Introduction 

For the past- few months X hav« bean studying the probl* m 
of how to make a computer understand linguistic lnforration (in 
some generally accepted sense of "understand **)• 1 ha^e listened 

to courses on Linguisti c Structure (Dr. Chomsky) and Me ch anical 
Tran slat io n (Cr. 7ngve), and read the works of various linguists, 
logicians, psychologists, and computer programmers on sublets 
ranging through semantics, information retrieval, and mechanical 
translation. 

The remainder of this paper is dividsd into two parts: 
a survey of various ideas and results appearing in the current 
literature (with some editorial corsuent); ar.d a proposal for future 
work to include a computer system for storing and extracting semantic 
information. 

II. Background literature 
A, Ge neral Seman tics 

’Semantics'■ is generally studied from ono of two vlewp xt/iM?: 
purr or descriptive- Pure semantics, the kind studied by Gamao. 1 '* 
deals with the properties of artificially-constructed formal systems 

s\ • 

(which may or may not have analogues in the real world), with respect 
to rule 3 for sentence formation and designation of formal models and 

- * * f * 

truth values. We shall rather be concerned with deaci Iptlve semantics, 
an ermplrical search for ruiee governing truth and meardngfulnesB of 

1 m 

sentences in natural language. 

We quickly encounter the paradox of having to use words 
with which to discuss the moaning of words. In fact even of the 
word "meaning”. Any attempt to distinguish between object language 
and met a-language seems strained and artificial. A coan.on device 
is to define "meaning" with a very specialized sense, or refuse that 
it can be defined altogether. Quine, tongus in cheek, recognizes 
this difficulty in the following paragraph “ 

'One must remember that an expression's meaning (if we are 

’j|p Svr.b.'Mrlptri refer to Bibliography entries 
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to CtfSirtit Mich things as lae&ninSfl) ia ?10" bo bs confuted with th# 

03 j-fStij 1J any, th&t- the expression d&Bignates^ Sentences do not 

cl sei gr.au-> %% all---, though words In them ir. : -iys sar^new iti'S s:J w&ly 

not singular tom*., Cut sentences sM.J.1 fcavfl meanings (if we adjust 

s job things ^b uaanlnss j;~ and the Heading of ari ofcarrtal sentence 

is tfc-> objocL designated by the singular term .round by bracketing the 

B'.untaiioo. "ShAt singul^i 11 terra will fcwvs a m sailing. in tmT) {if we 

5 . , T e prodigal fci igh with meaning^ ) f but It Till prCEauiiii bly bi? 

something furfcEvr* (Tnder this apprc&nb the maaziing (if such fch.ura 

t?) of ths noti-items! sentence Tha docz- h open 1 ' 1st not & 

p ?apon 1,tior.. * □ ^ —■implying that the illbai ;e tp -alining of r '!bc doo:-’ 

iii opsn f p ia sock. consists nituililt* aafc of e ir cjidb(; anc. qe. aurouidlnj 

a particular occasion on whfeh t-Vio Btatblmetit '^Iig door ia oppn t fl wae 

uttered.. Clair ly this kind of concept does not lend i iso If to 

coluber usage.. 

■a 

Z'iS? in; nor-. precise in making the following diE tine Won: 
uiorda rcn,y have mealing (but not significance)! utterance*. (phraseu , 
sentences) raay Jmvt aignifioaisce (but not- meaning) ,. However he 
sbatss bh-at an anal^is of the significance of a whole utterance 
tirtoot be consisted uJfhoub an analysis of -the meanings of bhe words 
i.c the utterance.. 

ji 

Another simi3»ir approach is that of Uilararm who considers a 
word as t-b L i sraalluiifc signifioant unit with Isolated “content" 

Wiwre.’ie phrases, sentences, etc.., astpress reliiSbions between the 
things which ar > symbolized by individual i/ords* Here Waning 6 is 
difined as r, a reciprocal relationship between tbs n&cic and fehs scsise, 
which enables the on* no call up tSis other * n By “sense" Is Ltflftiit 
trib thought or i of or once to an object or association (referent) which 
i s represented !y the word (symbol} ,. Nut-e that meaning: hers relates 
word with thought about object* not neons airily with object ilocl 
in this, com:-action let us mention Walpole 1 a definition 

routsa* Ha obuerves that s word tsay b^ defined by direct 
eynltjolinasion, associating it with an observable object but alee 

any othtir kind of association, connection, or characteristic, !?ueb 
as location or nfcate or legal re^^ation, can bo u&edi and the precise 
connection is generally d^gcrlbad verbally* Walpole also notes fand 
wo shall us 1 ? tbiio fact tat-^r) that soutsit isord rclationahipE t nuch 

v/holt to par^ or gercrCLiE&tion to ap jclal oastfjjdeiflnnlng par^lkl 
o:7doFin^B of large claefi, j s of words {agpstcially to^nmon norm} into 



tr©s structures. However the cl add of abstract; nouns or '’fiction®, K 
which flo hot nacitfc any ohjest in s.ny specific s ars c-Expert sites, d*i- 
net lend tVhsiHs c1v*je to such ordering 
h-, _ and jto^riing _ 

•JUliua far vie havn Alec 11*0 jd meahlUE tthl Its- Ignoring thf> 
gramoar Syntax) of language —■ but cleanly granxuar must b& 
r: omsIdK.rred 3j.n1'. e an ungrasHEtitAciil sent one e 1.3 not likely to he 
\*:oy meaningful * 

A ^grammar" is usually dt'finfed a 3 a a at? of rules Ton 
generating tha sraunnotical Bentfsho-js of a ],E.^uf^-5, and none of 
th.- umgrtjmstical once* B*rivfrg. a grammar is ^n empirical buslRfcsn, 
(listen the) ultimate test of whether a statement Is grammatical or 
rofc in to ask a rjativr speaker* Considering only the 'uncfclonal 
rolep cf wo prig :Ln e^manccE (their l Darfca-o.f“ speech' 1 ^ tat not 
thsir n]ie.:iifi*3 .In any secbe, Chciraly ' cievelopB various rcodulB ?os 
Iftigliah granular: phrast structure la a simple concept .nd works for 1 
a small part of th* language, but la gen really inadequate; trs-ms- 
t. anG&tlorial schemes art- probably ad-squat r, but aro complicated Sid 
difficult to conpl?tc or test* 

Although syntactics purports to ignore meanings., tb?? 
tcuijdary line between grammar a nd sonant Ice is quite h izy. Fcr 
example, omra linguists classify che go-called *Wob muns^erS.* Vafcer *) 
as a separate grammatical group since they don^t take the article; 
how*ver the distinct.ion between |: 1 want meat 14 and *1 w,..nt a steak* 
la ^j;n^r?i:.l?/ considered to bE a semantic one.. 

Ziff d^finsa tiKaningfullneae- In tiorsfl of rigidity of gr&sBft&ticel 
structure, Words which are h*ce 3 &£u T y in a particular grammatical 
configuration, such as fre^dshfe occurrences of l; to, f tJ do, * H tsVi e c etc *, 
arc said to h&vt no meanings On the ocher hand words v.hich could 

bs replaced by 0 I range- nii&ibsr of aid #rn&tlv^s within a ^liren 

T 

graiEm&tical context are considered very meaning? ul . t Simmons raakijs 
this distinction between func tion words and cont ent words even mere 
sfr&rp, as we shall sse later a ii-e shall accept thcas notions oLiIy to 
tnc extent that they do indicate a close conn set ion between grammatical 
a.nd semantic classifications* 

V % Eilatlnfi Com put er Programs 

Sovoral computer programs i.ava been written lr. the past far-- 
y ..'ars which artt relevant to the- prfEc:nt disc of s ion * Soke of thi- 
features of these are suLiLwariz-ad tcl^u 



*, 1) POdraan'e analysis for si triple EngliBbn (B) Tftie program* 

designed to tranel&ta elsnple English commands inti tool in* 
machine instructions, performs the mcesairy rjyntactin analysis 
on a carts.la class of simple sentences which arft limited to a biivlII, 
prescribed vocabulary - Vbs&bulary 1 terns zm Identified with part 
o?-ap'joch, sew? certain are oar iced as ‘'bay 1 v rov'd a w 

2) Phillips 1 "Question*Answering Rotrtinu" 3 Again a dictionary 
of all tisc'd^d words and thsSr parts of speech is provided. 

Sentences are analyze on the basis of a phrase‘'Structure grartraar, 
ard placed in a canonical tabular form)* Itm lueatlon is ae suited 
to involve only ■■ r:rtain factors , and is transformed to matched 
against the sentences in the text,, until fch>? approprlat s information 

lfi found* 

3) Simmons 1 grairorafcical coding** 0 By ff.ikihj proper note of 

c fonctdon * words fpropositions, auxilliavy verbs t nonJunctions T etc,) 
which art- trays to graim^ticfll Rtructure, suffices, and eiiipirical 
miles of grammatical nontext iallowable pairs and triplets of 
syntactic formsJ, this program can scan arbitrary text at a high 
Tate of spar’d and tag each word with it s appropriate ptrt of speech, 
tfith a high degree of accuracyu 

H) S"fWTBftX^ This system, design ad to ff synthesise human 
language behavior, h; starts by so inning arbitrary text, performing 
tie granraatical coding described in 3) above and indexing all 
oocurrettcen of content (non*function) words, when a question Ib 
read, its content words are identified and rn levant a sc. t ions of texv; 
exfcr&ct^d by ue □ of the index* An answer la thou oosipoavd baaed on 
a matching proet-BB similar to Phillips', although probably 
somewhat more elaborate* 

5 ) ,^ E■aEfibsLlI"'■ , '' This program, written in the IFL'-V programing 
language, is dr;Bi^n^d to ana war any retie cmble verbal English question 
about the results of a eat of baseball games.! lha data is a certain 
tr#r> structure containing a]1 the information about all games, 
Q.u^Btlons are analyzed syntactically with bh& aid of a dictionary, 
and the resulting fontJB completed by references to the data atructuus. 
labulat^d aLisw^r lnfoMtation is than printed out* Tht dictionary 
ban a. act of values of certain attributes for each word, s^ch as 
part of speech, whether part of an idiom, and Vanning \ r 'Hi=!anlng, * 
wh^nEi only appsariii for certain word?, is canonical trane-lation 
within the context of the program* e^gu the -veanlng of "who* is 

' f $eai»# J ¥ n This procedure is adequate for- the prcbj.tra, but would be 



difficult to gun*rails® .for implication to wider contexts 

6 ) table's Bsmsntic strua times <. £aMe tm-sitva^i 9 u 9 &%- 
iacluiiion Tea for storing Itsraa in a United taahiilcal voe&tulbjya 
'4?he infonriotlah r^fcrtevsl problem le Finpl l£l*:d by usir^r. £ e Ir^l ft 
saheraj for coding the location of on the tr-w by level & nd 

node count, ami information about related ifc-aits c.*n h>= obtained 
from their tree locationsOur proposal below wUl involve a 
gangsM). ap; licatlon of similar s&raanfcla trnfl-jstrucfcuru principle* 

It 3 s inters ting to note that all the above programs arc 
baned on algorithaiic aymfcacfeitt: achegitB* While it is presently 
generally recognizee tlv*t of B-enantic notions will le er-auntlul 
fo- diffic It langtii*£a-£ recessing problorae, useful apoclflo 
ptHigram&bXe achttirge IhwoG on aemtibtic Ideas have thug far bean too 
■ftlwive. Ite currant state of this ptiass of the problem is discussed 
in the following auction. 

' £* . . Ag p-11 c a 13. ona _of J tem n tll ft» . 

Ehup^aeecity for hiving eemanUe information available h>o 
become most apparent j.n r^ebt work cn nseehanic&l translation of 
hatur&l language, tlctionfery look-up and syntactic context aoheoe^ 
hava failed miflfrrabjy, and the only h«lp a bilingual bureau c^n glv.? 
«h>n aakM his translation method ia to assert that he reads in on* 
laigu^e, ’understate;" what wfiS read, and then rs-OTjjrwases the ei©s 
1 " In the new language. It la apparent that the aaiaa process ,-33 
involve in hua^n reading aosiprahiHislon and qii<tetion-&n£;tt«ri.ng 
procedures and arc Therefore probably the icoy tc efficient 
language procusBlng by machine. 

to ordar to understand 'Wderefcandiiig, '■ we should look at what 
la involved in learning r^w words, Qiuine^ describes fchrw basic 
approaches! l)pointlng In isolation ^stimulus meaning")f 2) from 
context {pdrfornihg fchti necefiflary inductive Uferenoe^ 3) by 
description (definition — In terras, or othvr words), 1 feel that 

while the first la the most fundamental* the thirl approach Id by 
far the racket important in building a vocabulary and recalling 
Meanings d r ZXt£^ dvflnos semantic relations us »oorrelations 

between types of or a type of event and a state of affairs,, or 

i type of word and a type of thing, cr- a word and a thing and so 
forth, - 1 ftewevar since fcypea and afeateo are generally associated lrlfch 
sp'..:Cr.L,flc wordsj ail of the above except the last ady he consld^rid ,-33 



la a system, 'Th&y inter into all kinds of groupings held 

together by a conplex, unstable end highly subjective network of 
association?: associations b’Stwfr.m the n&h'ea and the senses, 
associations baaed on similarity or somo othea relation. It is by 
their effoots that these associative connections make themselves 
f ait - f , , .The sum total of tbess associative networks is the 

™i 

y^gattilsrjE L L 

Ora way to deal with the problem of semantics night be to 

avoid it fry translating ordinary language into a formal syctfrm which 

could be handled syntactically* ^ Unfortunately this procedure, if 

possible at all > would obscure the real problems in a i.-^sa of ctetailed 

documentation and notation, and fre of little general v^luo,, At first 

view Freuienbhal, * 3 IIWC 03 may seeiii like a formal system for 

describing human behaviorf hut it is actually quite far from such, 

and assumes far greater abilities for Inductive inference of rules 

and situations on the part of the receiver than is expected of the 

usual language students 
IS 

Quill Ian i.s attempting tc represent the semantic conttmt of 
aords ae seta of ^concepts ,r r which could be combined to represent the 
tir^aninga of phrases and sentences , With the b&aic premise f-haf: 
learning a nen word involves mouPuring ita values on & sot of basic 
scales, ho is trying to build up & repertoire of 3uibatla coordinate 
scales (which are generally intuitive, unldircensicmal coorainjLtss: 
eg* length, bine, hue, etc*}, and coda the corresponding repr^aant&tions 
of English words. He also permits defining words in taros of 
predefined words as coordinates. Same of this work Is bcih 3 programed 
for the computer in the CGMIT system* My fueling is that the 
relations between, words 1 b more Important than the conceptual rceanliog 
of iTcdiyioual words In fc^rma of something store basic (* 3 Burning each 
can be sjitgbly found )i and therefore a nircpler approach which Ignoree 
teas a TE&anings might be more imctediat'ely fruitful, 

"T fj. 

Sotrmors is more specifically concerned with pensile ti able word 
combinations u Kc first describes & hierarchy of sentence types: 

1 Ungroonatlcal; 2) GrariJiBfttlc&l but noraermaj 3) Sensible but fslsci 
4 ;- True, Jte then argues that tte> crucial semantic distinction 11*:$, 
between the grsnwblcal declarative sentences which arc nonaanse, 
and those which are significant {but may be true or false), £ny pair 
of tconadic prtdicatan P^ # are said bo have a sense v.aua C*£f] *F~J 
if there exi&bB any significant sentence conjoining th*n» Otherwise 



bhw have v^Iuke^LA.. Ths relation in gymbotrlo and in 

pr 5s<,?rv6cl undt?r the usual logical upn-vat Iour cn if;a arEUi)in ; :t, btrfe in 
not tranuihive a A etransfer uuiiofi- relation Q P is fcrut if "of;what 

1 : 2 }?,. it o-ain b* significantly said that it la Q.. * r ^ pr:fro*> 

ciil lister, Q, = quick 1 ** 'Shis pcrraifca the arrangement of thes** 

F, ttomd:ic predicates' 1 into a simple fcreu, ttoftre a-li words in the? 

ga:rtr ran ratting class ?>.,g., all colors, or ail vords describing wdg.it) 

cc rupy the E -ira^j nodisi* 31y main objection bo this work it; in inhere 

th j Imporb^nt distlrLcSiosia- lie* SoicinjjL'E would Argue thfit n th<s idox 

16 alwtuy? is noils Ans*5p but a ths &lrj Is alnaya gi^!ii' r ±s 

'i-i lElbie (sines sky may Jw"£ colov i .for 3 tty is blui * and bbei'o.forv 

"t^e ‘sky lo not bInv F ur«* significant ) t slyho&gh false* tfote Chat 

“ideas canoe b hs grssri ttould b*? consider Ed ncr^mse* r&thur than 

true* by Seimnrs u 1 feel the distinction beacon f nonsenss" arid 

Vans Ibis but not true of the real world L to be too hafcy to tho 

basin .far ^ stfnntic system 'such as that b&v*d on the U ;,nd +-relations )* 

17 

A book by Nida' p ttiecdsaas snvar&l types of possible word 
associations* Chain associations r Une.ir ordering of terras 

each as mineral? or colors of the (spectrum, have very limited 
applicability* flSerarchical ordering according to generality 
(cies^ inclusion) 1: ssspatially one eah-ems which we shall adopt., 
wl modifications* 1 dor* 1 1 believe as Kida dona t.hab pronouns havs 
a place in the hi@rit.rchy e.g very general turn* Esther, hh-oy should 
bs? replaced by bhoir antecedents* Constituent analyeiH is a schera? 
similar bo quillisns, and again requires Wis difficult choice of 
coordinates and assignment of values* It ;ia difficult to distinguish 
between Jfida ; 3 "four basic types of fleroea as fundamental ocii^omiyje 
il'i s^iBanbic abrochure' and curb&in well-known gremtB^fcieel parts of 
speech”except that the rul^s for- Hecuaiitlc na.'sing of aent«nctiS are 
iMliuh Mscrr^- obscure* 

XIX ^ Proposal 

Let ue novr review fcha major point? laat&bllshed In the pr j L?osdlng 
Paragraphs: 

1 } Sohea»ee baaed on fcha syntactic analy&is of Englidh language have 
b&.-^n stieaeat fully progr^Jtv-.sd to solva certain 3 intlt$d Unguis tic 
problems* 

S) yuch arhemos sr^ not adequate, in thRraaelv$s r for any larger, 
more general probl^^ssi. 

3) ItoKwer, syntactic tiTU:.lysia fgrafiffia?) siusnst be an l tr.jiortanti 
pa^t of anj. more general method* 



4) Understanding seems to He busod largely on various 
associations bebtram worU, 

5 } of hhss a^iaatitilo e-tValyoiB ^chasrisij proposed til jib 

c-in be realizable on a amputer in the near future, largely became 
taay Involve procedure* which *r* too vague *nd astiera^ in comaatiw. 
with problem which are; too large* 

Mjr fadings with regard to point 5i) above ^re h&e^d on the 
following heuristics '‘■Large problems are scalar to solve if tft& 
Solutions to smaller, similar ones are available u Hero "similar r? 

±n ug(?d In its brotderit most ambiguous soha-e. It 1 wt * av<s lucky the 
ssaaU problem ffllU bum out to fe>i & aubpa-obltra of the largu 01 ^., or 
a apsci*l c ^e with obvious rout^a to gpfwjralip.abioji (ulShoL^a this 
rcigsht not apparent at fehr? outset J j how^vsi- hh ■> solution will 
^.vto9l ^1-veyt Bu^.at BfJShodfi which should be triad on the large p-oble* 
(or f Juf,fc a.e Important, methods if hi eh should not- he tried} fl 

* propose fcc- d^v^lop a cotiputKr prog^ra which will h®v$ tr^ 
ability to utilize semantic 1 ^backgrouixi* information trtrile abBtfOi-ing 
queeitior.B on available t«cb material* Tox^s will limited 

tc? * cert:-in amll vocabulary and simple aentenca striaefcurea, probably 
selected trom children'« literature* Grammatical lnfoa.-aaW.on such 
ay phrase atruotur* rules 'and the p*rt-of-speech labeling available 
from Sittings program 10 will he available to the system* The 
semantic Information will by implicit in the organization of feh c > 
fE_c *3 o/:3.' y which will contain divisions ?aiid gasociati oi^g of both 
syntactic and ataman tie nature*, as described below* 

Worlds will firsts bo cl&salfiaa by part of speech* Ibuns w m 
thw be arranged ic a hierarchical structure baaed on &merauty 
clasa inclusions £th±a will probably invclvu re-entrant tret: structure)* 
Olhier Sfluoaiatloufl will ul 50 be Indicated h?X^m wordp. in different 
P^rta of the tree, each as part-thole relationships, or Just 
abstractly Vela tod 1 ’ (i.o, litely to hppsar In tehs arm* Bentefice, 
s,g„ “dog* *nd "Bow-wow K ) * "'0 aorao extent a verb tree .-sen ^leo V>v 

Bet UP, wi^ti il Hiov* ,£ branching into 1 WlU, tf ‘'hop/ 1 and ^pjn/ for 
flJiamplc B 

Conn actions can then bn made between nod-a-fi In the noun tret* 
aj'd nodes In the- verb tree, indicating, say, L ‘it makes sense to u*c 
110 ^ tclow this point as the subject {or object) of any v^rt 
bolow that eorrsflpoiMling pointV Similar structur« could be nst up 
^ tMn th * olasatw of jidjcctlves &nd adverbs, and batweon thens and the 
JVUiia vt>rba which they nn^y modify. 
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This semantic dictionary will bo prop-ired In advance ;m 1 
fevallat-lu ho the initial Htate of the program* It non Id craneeIv 1:-1 r 
1?* ufcar. 5 - then to generate random Eit^liBh euntondes, all of which _k ay 
t© fame gf ths.raal world* Howsver^ the (|ucstiots-sinaw:tf'Ing proc.- dura 
would Follows 

11 As the feixt 1 b read, a ^fehresyi 1 is inserted into the 
dictionary which fillers the actual evelite oi" the text 

2,. iff ^Imposaiblo '' word relations or iiftiJ vocabulary .app-wr,, 
thrs program will eonsplal.n* 

3) When a q inis lion Je RSlrccip thf? woi'dh in th© q-watlorij, the 
test thread, and fch-a structure off the dictionary are a.'I avalist He 
to be used while oonposlng en answer* 

Aiuidguous words* oblique nw?fintn£flj sm3 abstract come^t^ will 
to avoided or Ignoroii. The purpcatJEi of this at^y will be 

1] to deter-alne whether it In pogait 1 q to store a significant 
Amount of - Amantla information* by (tueana ef the typo nf a true out e 
ciBEcrlbad, l.n a re .isonable aBoiant of spann^ 

2} To defctisaine fcha nature of the search and analysis p.coeed-.trea 
which nr-'a n$odgd to utlLite most iffioiently this avaiJ.sbl^ 

Bemantie Information* 

.i shall soon start prepari]^ detailed flow-chart; a for fcflft 
program 1 would rippreclate ftearittfj whatever i da->{; on aogjtUctlona 
gnyon^i wight havc a 
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